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Chapter 1 



The Convergence of the 
Extended Kalman Filter 

Abstract 

We demonstrate that the extended Kalman filter converges locally for a broad 
class of nonlinear systems. If the initial estimation error of the filter is not 
too large then the error goes to zero exponentially as time goes to infinity. To 
demonstrate this, we require that the system be C 2 and uniformly observable 
with bounded second partial derivatives. 

1.1 Introduction 

The extended Kalman filter is a widely used method for estimating the state 
x(t) G M n of a partially observed nonlinear dynamical system, 

x = f(x,u) 

y = h{x,u) (1.1.1) 
x(0) = x° 

from the past controls and observations u(s) G U C JR m , y(s) G M p , < s < t 
and some information about the initial condition x . The functions /, h are 
known and assumed to be C 2 . 

An extended Kalman filter is derived by replacing ( |1.1.1| ) by its linear ap- 
proximation around the trajectory x(t) and adding standard white Gaussian 
driving noise w(t) G M l and independent, standard white Gaussian observa- 
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tion noise v(t) E M l , 

z = f(x(t),u(t)) + A(t)z + Gw 

y = h(x(t),u(t)) + C{t)z + v (1.1.2) 

z(0) = z° 

where G is a n x / matrix chosen by the designer, 

df dh 
A(t) = <*)), C(t) = —( X (t),u(t)), (1.1.3) 

and z° is a Gaussian random vector independent of the noises with mean x° 
and variance P° that are chosen by the designer. 
The Kalman filter for fll.l.2Q is 

k(t) = f(x(t),u(t))+A(t)m + P(t)C'(y(t)-h(x(t),u(t))-C(t)z(t)) 

P(t) = A{t)P{t)+P(t)A'{t)+T-P(t)C'{t)C(t)P(t) 

z(0) = x° 

P(0) = P° 



where T = GG' . 

The extended Kalman filter for ( |l .1.1 ) is given by 



(1.1.4) 



x(t) = f(x(t),u(t)) + P(t)C'(t)(y(t)-h(x(t),u(t))) 

P(t) = A(t)P(t) + P(t)A'(t)+r-P(t)C'(t)C(t)P(t) 

x(0) = x° [LLb) 

P(0) = P°. 



Actually there are many extended Kalman filters for (1.1.1), depending on the 
choice of the design parameters G, x°, P°. We could also broaden the class 



of extended Kalman filters for (1.1.1) by allowing G = G(x(t)) and putting a 
similar coefficient in front of the observation noise in ( |lX2|) . We chose not to 
do so to simplify the discussion. For similar reasons we omit the discussion of 
time varying systems. We expect that our main theorem can be generalized 
to cover such systems. For more on the derivation of the extended Kalman 
filter, see Gelb §. 

Baras, Bensoussan and James |Q] have shown that under suitable condi- 
tions, the extended Kalman filter converges locally, i.e., if the initial error 
x(0) = x(0) — x(0) is sufficiently small then x(t) = x(t) — x(t) — > as t — > oo. 
Unfortunately their conditions are difficult to verify and may not be satisfied 
even by an observable linear system. Krener and Duarte have given a simple 
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example where any extended Kalman filter fails to converge. More on these 
points later. 

By modifying the techniques of [jy] and incorporating techniques of the 
high gain observer of Gauthier, Hammouri and Othman we shall show that 
under verifiable conditions that the extended Kalman filter converges locally. 
To state the main result we need a definition. 

Definition 1.1.1 The system 

i = /(M 

y = H£,u) 

is uniformly observable for any input if there exist coordinates 



(1.1.6) 



{Xi 



: % 



1, 



,P, J 



1. 



,k} 



where 1 < l\ < . . . < l p and Y2h = n such that in these coordinates the system 
takes the form 

y% = xa + hi{u) 

Xil = X i 2 + fil{x l ,u) 



Xij+1 + fij(Xj,u) 



(1.1.7) 



Xtii-i = Xiu + fik-i(x h -i,u) 
x ik = fuMki u ) 



for i = 1, . . . ,p where x_j is defined by 
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(xn , . . . , XlJ/xJj , X21 , • • • , Xpj). 



(1.1.? 



Notice that in Xj the indices range over i = 1, . . . ,p; k = 1, . . . , min{j, li} and 
the coordinates are ordered so that second index moves faster than the first. 

We also require that each be Lipschitz continuous, there exists an L 
such that for all x, £ G M n , u € U , 



\fi(xj,u) - fi(L,u)\ < L\xj-£ \. 



(1.1.9) 



The symbol 
Let 



Ai 



denotes the Euclidean norm. 



1 

1 










1 





£j x 



A 



A x 



'•■ 

A, 
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Ci 



Ci 



fi(x,u) 



[10 







fUi(s. u ,u) 



ilxL 



c 



f(x,u) 



h(u) 



'"■ 9 

o c p 

fi(x,u) 

f P (x,u) 
hi(u) 

h p (u) 



pxn 



nxl 



pxl 



then ( |1.1,7| ) becomes 



(1.1.10) 



x = Ax + f(x,u) 
y = Cx + h(u) 

A system such as ( |1.1.7| ) or, equivalently ( |1.1.10| ), is said to be in observable 
form Q. 

We shall also require that the second derivative of / is bounded, i.e., for 
any x,£ G M n ,u G U, 

d 2 f 



dxidx. 



(1.1.11) 



On the left we employ the convention of summing on repeated indices. 
Theorem 1.1.1 (Main Theorem) Suppose 

• the system is uniformly observable for any input and so without 
loss of generality we can assume that is in the form ( 1.1. 1C ) and satisfies 
the Lipschitz conditions ( \1. l.§) , 

• the second derivative of f is bounded ( 1.1. 1\ ), 



• x(t),y(t) are any state and output trajectories generated by (1.1. 1C), 

• G has been chosen to be invertible, 

• x(t) and P(t) are a solution of the extended Kalman filter ([1.1. where 
P(0) is positive definite and x(0) = x(0) — x(0) is sufficiently small, 



Then \x(t) — x(t)\ — > exponentially as t — > oo. 
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1.2 Proof of the Main Theorem 

We extend the method of proof of Q . Since the system is in observable form 

A(t) = A + A(t) , . 

c(t) = c [LZA) 

where 



and 

dfii 



(x(t)) = 



dx jk 
if k > r. 

First we show that there exists mi > such that for all t > 

P(t) < m 1 I nxn . (1.2.2) 
Consider the optimal control problem of minimizing 

e'(o)P(o)e(o) + /V(*)r£(s) + n'(sMs) ds 

Jo 

subject to 

i(s) = -A'is^-C'^s) 

m = c- 

It is well-known that the optimal cost is 

C'P(t)C 

where P(t) is the solution of ( |1.1.5|) . 

Following |P for 6 > we define S(9) as the solution of 

A'S(0) + S{6)A - C'C = -9S(9). (1.2.3) 

It is not hard to see that S(6) is positive definite for 8 > as it satisfies the 
Lyapunov equation 

f-A-tij S (e) + S{6)(-A-tl) = -C'C 
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where C, (—^4 — |j) is an observable pair and (—A — |j) has all eigenvalues 
equal to — |. It follows from ( 1.2, 3D that 



Let T{6) = S- l {9) > then 

T i3 , pa {9) = ei+^Tij^l) 

and T(9) satisfies the Riccati equation 

—AT (6) — T{9)A' + T{9)C'CT{9) = 9T{9). 

We apply the suboptimal control fi = —CT{9)^ to the above optimal 
control problem and conclude that 

C'P(t)C < r (0)P(0)£(0) + f* e(s) (r + T{9)C'CT{9)) £( S ) ds{l.2A) 

J 



where 



£( S ) = (-A'( s ) + C'Cr(0))£(«) 

«*) = c- 



d (!{s)T(e)t(s) = e(s){9T(9) + T(9)C'CT(9))as) 



Now 

ds 

-{'is) (A(s)T(9) + T(9)A'(s))as). 
Because of the Lipschitz condition (|1.1.S| ) we conclude that 

\A(s)\<L 

and 

\A(s)\ <L+\A\. 
From the form of A(s) and T{6) we conclude that 

(a(s)t(9)) = o(9y +a - 1 

V / ij, pa 

while on the other hand 

0T ij>pa (6) = 9i +a T ijjPa (l). 
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Hence we conclude that for any a > there exists 9 sufficiently large so that 

0T(6) + T(6)C'CT(6) - A{s)T{6) - T(9)A'(s) > al nxn . 
Therefore for < s < t 

Now there exists m,2(0) > such that 

e(s)m < m 2 (e)e(s)T(8)t( S ) 

so we conclude that 

There exist constants > 0, m 4 {9) > such that 

P(0) < m 3 I nxn 
V + T(6)C'CT(6) < m 4 (6)I nxn 



From ( 1.2.4 ) we obtain the desired conclusion, 

ft 



£P(t)( < m 3 e- at C '( + m 4 (6) [ e^^fC ds 

Jo 

('P(t)( < m 3 ('( + m 4 (6) [ e a ^C'Cds 

C'P(t K < ^±n^l cc 

a 

Define 

Q{t)=p-\t) 

then Q satisfies 

Q(t) = -A'(t)Q(t)-Q(t)A(t)-Q(t)TQ(t) + C'C 

Q(0) = P- 1 (0)>0 [L - Z ' b) 

Next we show that there exists > such that for all t > 

Q(t) < m 5 I nxn . 

This will imply that 

P{t) > —I nxn . (1.2.6) 
m 5 
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Consider the optimal control problem of minimizing 

£'(0)Q(0)£(0) + f\\s)C'C^ S )+i2'(s)^s) ds 



Jo 

subject to 

m = c- 

It is well-known that the optimal cost is 

C'Q(*X 

where Q(t) is the solution of ( 1.1.5|) . 
We use the suboptimal control 

f ,(s) = G- 1 (aI n * n -A(s))as) 

so that the closed loop dynamics is 

£(s) = a£{s) 

m = c 

From this we obtain the desired bound 

c'q(*k < e'(o)Q(o)e(o) 

+ I {C'C + (al nxn - A'(s))r- 1 (a7 nxn - A(s)j) ds 
Jo 

C'Q(t)C < e- 2at C'Q(0)C + / e^^^C' (C'C + (a + L) 2 ^ 1 ) C 

CQ(t)C < (™e + g) C'C 

where 

Q(0) < m 6 7 nx " 
C'C + (a + LfT- 1 < m 6 I nxn . 

Now let x(t),u(t),y(t) be a trajectory of the system ( 1.1. 10|) starting at 
x°. Let x(t) be the trajectory of the extended Kalman filter Ql.l.E] ) starting 
at x° and x(t) = x(t) — x(t), x° = x° — x°. Then 

j t x'{t)Q{t)x{t) = -x'{t){C'C + Q{t)YQ{t))x{t) 

+2x'(t)Q(t) (f(x(t),u(t)) - f(x(t),u(t)) - A(t)x(tj) . 
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d 2 f 

f(x(t),u(t))-f(x(t),u(t))-A(t)x(t) = I I r—^—ix^+rsx^^it^Xii^Xj^dsdr 



Now following M 

1 rl J? 
o Jo dxidxj 
< L\x{t)\ 2 . 

Since G is invertible there exists mj > such that 

T > m 7 I nxn 

and so 

—x'(t)Q(t)x(t) < -^\x(t)\ 2 + m 5 L\x(t)\ 3 



If 



< - z J^5 / (i)Q(t)x(t) + m 5 iv(m 1 £ / (t)Q(i)£(t))i 



{x'{t)Q{t)x{t))? < "' 7 



7 

2m^m 2 L 



then 

so x' (t)Q(t)x(t) — > exponentially as t — > oo. Therefore if 



(5'(t)Q(t)x(0))i < — ^ 



the extended Kalman filter converges. 



1.3 Conclusions 

The above result does not follow from that of Baras, Bensoussan and James 
|l]]. To show local convergence of the extended Kalman filter they required 
"uniform detectability" . They define this as follows. The system 

X = ^ (1.3.1) 

y = Cx 

is uniformly detectable if there exists a bounded Borel matrix- valued function 
A(x) and a constant a > such that for all x, £ € M n 
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This is a fairly restrictive condition as not all observable linear systems are 
uniformly detectable. Consider 



1 

a\ a 2 

[ 1 ] x. 



Suppose 



then 



A(s) 



g»+A(*)C 



Ai(x) 
A 2 (x) 



Ai(x) 1 
A 2 (x) + ai a 2 



If a 2 > and f ' = [ 1 ] then 



Z'(yt(z)+A(x)C)Z = a 2 >0 

so the system is not uniformly detectable. This system does satisfies the 
conditions of Theorem so an extended Kalman filter would converge 

locally. Since the system is linear, an extended Kalman filter is also a Kalman 
filter that converges globally 

An example || of a highly nonlinear problem where an EKF may fail to 
converge is 



x 

y 



m 

h(x) 



x(l 



x 



x/2 



;i.3.2) 



where x,y € M. The system is observable as y,y,y separate points but it is 
not uniformly observable. The dynamics has two stable equilibria at x = ±1 
and an unstable equilibrium at x = 0. Under certain initial conditions, the 
extended Kalman filter fails to converge. Suppose the x° = 1 so x(t) = 1 and 
y(t) = 1/2 for all t > 0. But h(-l/2) = 1/2 so if x° < -1/2 the extended 
Kalman filter will not converge. To see this notice that when x(t) = —1/2, 
the term y(t) - h(x(t)) = so x = f{x(t)) = /(-1/2) = -3/8. Therefore 
x(t) < -1/2 for all t > 0. 



1.4 Dedication 



This paper is dedicated to my esteemed colleague and good friend, Professor 
Anders Lindquist on the occasion of his 60 th birthday. 
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